Techniques for Highly Accurate Optical Recognition of Handwritten Characters and Their Application to Sixth Chinese National Population Census
نویسندگان
چکیده
Highly accurate optical character recognition (OCR) of handwritten characters is still a challenging task, especially for languages like Chinese and Japanese. To improve the accuracy, we developed four techniques for enhanced recognition: character recognition based on modified linear discriminant analysis (MLDA), subspace-based similar-character discrimination, multi-classifier combination, and mutual-information-based adaptive rejection. They were applied by the Chinese government to the Sixth National Population Census in 2010. By combining address and nationality information, they achieved an accuracy of over 99% with a low rejection rate. This was the first time that optical recognition of handwritten Chinese characters had been used on a large-scale in the Chinese census project.
منابع مشابه
Approaches to chinese character recognition
Work on the recognition of Chinese characters is reviewed. All of this work has been reported in just the last nine years, and most of it since 1970. The topics covered are: printed Chinese Character Recognition—Half a dozen groups have tackled this problem, using methods as varied as syntactic description, projection profiles, template matching, Fourier transforms, and hierarchical processing;...
متن کاملNeural Network Based Recognition System Integrating Feature Extraction and Classification for English Handwritten
Handwriting recognition has been one of the active and challenging research areas in the field of image processing and pattern recognition. It has numerous applications that includes, reading aid for blind, bank cheques and conversion of any hand written document into structural text form. Neural Network (NN) with its inherent learning ability offers promising solutions for handwritten characte...
متن کاملHindi Numeral Recognition using Neural Network
Handwriting has continued to persist as a means of communication and recording information in day-to-day life even with the introduction of new technologies. The constant development of computer tools lead to the requirement of easier interface between the man and the computer. Handwritten character recognition may for instance be applied to Zip-Code recognition, automatic printed form acquisit...
متن کاملDifferent Edge Detection Algorithms Comparison and Analysis on Handwritten Chinese Character Recognition
The acquired image will has different levels of noise pollution and image distortion in Handwritten Chinese character recognition document image processing. For these situations the accurate and fast edge detection method is an important prerequisite for the recognition results. The widely used edge detection algorithms such as: first derivative-based edge detection method, second derivative ed...
متن کاملTitle Consistent relaxation matching for handwritten Chinese character recognition
Due to the complexity in structure and the various distortions (translation, rotation, shifting, and deformation) in different writing styles of Handwritten Chinese Characters(HCCs), it is moresuitable to use astructural matching algorithm for computer recognition of HCC. Relaxation matching is a powerful technique which can tolerate considerable distortion. However, most relaxation techniques ...
متن کامل